A mutation-centric approach to identifying pharmacogenomic relations in text

نویسندگان

  • Bastien Rance
  • Emily Doughty
  • Dina Demner-Fushman
  • Maricel G. Kann
  • Olivier Bodenreider
چکیده

OBJECTIVES To explore the notion of mutation-centric pharmacogenomic relation extraction and to evaluate our approach against reference pharmacogenomic relations. METHODS From a corpus of MEDLINE abstracts relevant to genetic variation, we identify co-occurrences between drug mentions extracted using MetaMap and RxNorm, and genetic variants extracted by EMU. The recall of our approach is evaluated against reference relations curated manually in PharmGKB. We also reviewed a random sample of 180 relations in order to evaluate its precision. RESULTS One crucial aspect of our strategy is the use of biological knowledge for identifying specific genetic variants in text, not simply gene mentions. On the 104 reference abstracts from PharmGKB, the recall of our mutation-centric approach is 33-46%. Applied to 282,000 abstracts from MEDLINE, our approach identifies pharmacogenomic relations in 4534 abstracts, with a precision of 65%. CONCLUSIONS Compared to a relation-centric approach, our mutation-centric approach shows similar recall, but slightly lower precision. We show that both approaches have limited overlap in their results, but are complementary and can be used in combination. Rather than a solution for the automatic curation of pharmacogenomic knowledge, we see these high-throughput approaches as tools to assist biocurators in the identification of pharmacogenomic relations of interest from the published literature. This investigation also identified three challenging aspects of the extraction of pharmacogenomic relations, namely processing full-text articles, sequence validation of DNA variants and resolution of genetic variants to reference databases, such as dbSNP.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Using PharmGKB to train text mining approaches for identifying potential gene targets for pharmacogenomic studies

The main objective of this study was to investigate the feasibility of using PharmGKB, a pharmacogenomic database, as a source of training data in combination with text of MEDLINE abstracts for a text mining approach to identification of potential gene targets for pathway-driven pharmacogenomics research. We used the manually curated relations between drugs and genes in PharmGKB database to tra...

متن کامل

Narrative Vagueness in Grass’s The Tin Drum: A Text-Centric Model of Narration to Reveal Dialogized Heteroglossia

The present study sets out to investigate the narrator’s textual position in Grass’s The Tin Drum. Although authorial self-dramatization through affinities with one or more characters in the work is undeniable, this study mainly concentrates on the inner interpenetrations of heteroglot utterances as uttered by an unreliable first-person narrator, Oskar Matzerath, in the light of the Bakhtinian ...

متن کامل

Identifying Strategies Affecting Iran Public Diplomacy through Sport and Its Consequences

 Today, sport has a close relationship with politics and has become a matter of international relations. Therefore, this study was conducted with the aim of identifying those strategies affecting Iran public diplomacy through sport and its consequences. The present study had a qualitative approach and the required data were collected using semi-structured interviews. The statistical population ...

متن کامل

Pharmacogenomic Profiling of the PI3K/PTEN Pathway in Sporadic Breast Cancer

Background: Pharmacogenomics is the study of genetic variations among individuals to predict the probability that a patient will respond to single or multidrug chemotherapy. Breast cancer is one of the most common cancers among women worldwide. Treatment of breast cancer by application of biological rationales gives us the ability to match the correct pharmacology to individual tumour genetic p...

متن کامل

Pharmacogenomic approach in type 2 diabetes treatment

Introduction: Type 2 diabetes (T2D) is chronic health caused by the interaction between genetic and environmental factors that results in high blood glucose. The evidence-based guidelines for diabetes management are mainly based on lifestyle changes, control of risk factors, and the management of blood glucose levels. Although numerous antidiabetic agents have been developed over time, T2D trea...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Journal of biomedical informatics

دوره 45 5  شماره 

صفحات  -

تاریخ انتشار 2012